Transforming Multiple-record Data into Single-record Format When Number of Variables Is Large

نویسندگان

  • David Izrael
  • David Russo
چکیده

In one large survey project each record in the raw data set contains information from one survey form. At some point, Adenormalization@ of the data needs to be done transposing a multiple-record-per-object data set to a one-record-perobject data set that retains all raw variables from all data sources. The large dimension of the original data set requires an approach that can: a) carry out the denormalization in a reasonable amount of time; b) handle the indexing and standardization of variables= names automatically; c) easily adapt as the set of variables changes over time. One approach uses a two-step PROC TRANSPOSE. This approach deals elegantly with issues b) and c) , but it takes a substantial amount of time to run. The second approach uses a driving macro based on the RETAIN statement. Although this method requires some maintenance, it runs much more quickly than the TRANSPOSE method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Probabilistic Linkage of Persian Record with Missing Data

Extended Abstract. When the comprehensive information about a topic is scattered among two or more data sets, using only one of those data sets would lead to information loss available in other data sets. Hence, it is necessary to integrate scattered information to a comprehensive unique data set. On the other hand, sometimes we are interested in recognition of duplications in a data set. The i...

متن کامل

Asymptotic Efficiencies of the MLE Based on Bivariate Record Values from Bivariate Normal Distribution

Abstract. Maximum likelihood (ML) estimation based on bivariate record data is considered as the general inference problem. Assume that the process of observing k records is repeated m times, independently. The asymptotic properties including consistency and asymptotic normality of the Maximum Likelihood (ML) estimates of parameters of the underlying distribution is then established, when m is ...

متن کامل

THE CAPABILITY OF OPTIMAL SINGLE AND MULTIPLE TUNED MASS DAMPERS UNDER MULTIPLE EARTHQUAKES

The main focus of this research has been to investigate the effectiveness of optimal single and multiple Tuned Mass Dampers (TMDs) under different ground motions as well as to develop a procedure for designing TMD and MTMDs to be effective under multiple records. To determine the parameters of TMD and MTMDs under multiple records various scenarios have been suggested and their efficiency has be...

متن کامل

On Moments of the Concomitants of Classic Record Values and Nonparametric Upper Bounds for the Mean under the Farlie-Gumbel-Morgenstern Model

In a sequence of random variables, record values are observations that exceed or fall below the current extreme value.Now consider a sequence of pairwise random variables  {(Xi,Yi), i>=1}, when the experimenter is interested in studying just thesequence of records of the first component, the second component associated with a record value of the first one is termed the concomitant of that ...

متن کامل

A Statistical Analysis of the Aircraft Landing Process

Managing operations of the aircraft approach process and analyzing runway landing capacity, utilization and related risks require detailed insight into the stochastic characteristics of the process. These characteristics can be represented by probability distributions. The focus of this study is analyzing landings on a runway operating independent of other runways making it as a single runway. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998